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INTELLIGENT HARVESTING AND screen size, the presentation of information on any one 

NAVIGATION SYSTEM AND METHOD device is not optimized, for example, for its screen size due 

to the variety of formatting alternatives. 
This application claims benefit of provisional application Most of the prior approaches to wireless content delivery 
Ser. No. 60/160,801 filed Oct 21, 1999. 5 have involved linking a certain browser with a certain 

protocol and a certain mark-up language. There is not a 
BACKGROUND OF THE INVENTION ^ standard ^ fe me multitude of 

This invention relates generally to a system and method wireless handheld devices and information appliances. As a 

for delivering content to information appliances and in consequence of this lack of a standard, content providers are 

particular to a system and method for permitting web pages 10 forced to re-author and re-format their web pages in order to 

in different formats to be communicated to various different generate content for each of these devices, 

information appliances having different display footprints. Compatible languages, such as Extensible Markup Lan- 

Today, people have an unquenchable thirst for informa- guage (XML), a software language designed especially for 

tion that demands instant access at any time or place to the WeD documents, have become much more mature and 

information. There has been explosive growth in the hand- 1 permit re-formatting of HTML or XML web pages on-tbe- 

held computing market and in the cell phone market with Ay to formats that individual devices can utilize. However, 

over 300 million users worldwide. In addition, there has none of these conventional systems and solutions provide a 

been a continued increase in the number of people with single unified system that permits web pages having differ- 

access to the Internet. These three different markets will 2Q cnt formats and mark-up languages to be delivered to 

soon converge as cell phones and handheld computers will different information appliances that may use different 

have web browsers ("mini browsers") integrated therein. To protocols, different browsers, or have different input/output 

promote the convergence, cell phone manufacturers and formats (e.g., different screen sizes). Thus, none of the 

wireless data network providers have attempted to standard- conventional systems provide an intelligent navigation sys- 

ize Internet content distribution with the wireless application & m wherein content may be delivered in a customized 

protocol (WAP). Internet content providers have been slow 25 manner to the different information appliances, 

to adopt these standards and now several major device Another problem with the conventional systems is that 

manufacturers have begun to create their own proprietary content providers have not been able to control the "look and 

standards. feel" of their site using these other solutions so that the site 

Two factors hinder the extension of the Web and is content ^ may look very different on different devices. It is desirable, 

from the personal computer (PC) environment with fairly however, to provide a system that will allow these sites to 

standard display formats to the non-PC based information customize the presentation of their site web pages to the 

appliances and devices. First, re-purposing and converting wide variety of information appliances. Thus, it is desirable 

existing PC-centric HTML web sites to the new breed of to provide a content delivery system and method that solves 

information appliances with drastically varying screen sizes 35 the above limitations and problems with the conventional 

is very problematic. For example, it is not appropriate to put systems and it is to this end that the present invention is 

the content of a PC-centric web site onto a small smart directed, 
phone screen linearly. An intelligent navigation scheme that 
automatically converts content intended for the PC into 

content applicable to one or more different information ^ The content delivery system and method in accordance 

appliances is needed. This intelligent navigation scheme with the invention solves tie above problems and limitations 

would optimally vary its output based on the screen size of with conventional systems and solutions by providing a 

the particular information appliance. Second, most current system and method that delivers Web-based content, 

wireless content delivery solutions demand adherence to commerce, enabling transactions, and services to a variety of 

proprietary browsers, proprietary mark-up languages and/or 45 information appliances and devices without requiring the 

proprietary protocols. re-authoring of the content information for display on each 

As FIG. 1 illustrates, there are currently multiple different of these different devices, 

mark-up languages 2, multiple different protocols 3 and ■ ^Id accordance with the invention, the system and method ^ 

different browsers 4. For example, Phone.com has intro- 'permits content to be input into the system in a variety of 

duced both the HDML and the WML- protocols for cellular 50 different formatting languages. In addition, the system per- 

telephones in the United States, whereas Japan has adopted mits the formatted content to be output in any mark-up 

the I -mode protocol. Palm Pilot devices support a variant of language and protocol, such as WML, HTML, HDML, 

the HTML protocol that uses web-clipping, while Windows XML, etc. Advantageously, each display page on the device 

CE devices support only a limited HTML protocol using may be customized. To organize the content for display on 

special software such as Pocket Explorer. To establish an 55 the devices, the received content information may be 

effective wireless presence, a company with content must mapped into a hierarchy of groups so that the content 

support the multitude of different information appliances 5, information can be optimally formatted for display on the 

the different protocols 3, the different markup languages 2 devices according to the input/ou,tput format, such as the 

and the different browsers 4. display^screen^e^aiameters of the devices. J 

The number of devices, protocols and mark-up languages 60 In more detail, the method for content delivery may 

create a large matrix of different combinations of devices, include intelligently harvesting content from a web page to 

protocols and languages wherein each combination requires provide that content to a plurality of different information 



SUMMARY OF THE INVENTION 



a different web server. The rewriting of a site for each 
mark-up language and for interfacing with each different 
protocol and screen size is expensive, complicated and time 
consuming. In addition, because each different device may 
have a different input/output format, such as a different 



appliances having different screen sizes. The intelligent^ 



harvesting mav convert the content into a^propnetarv rela- 
tional markup language (RML) and generate a tree an d then 
a document object model from the RML content. The tree 
may then be analyzed and searched using a set of processing 
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rules in order to generate content screens customized to eacb information appliances having different display formats and 

information appliance. Atypical card builder may build the different screen sizes and it is in this context that the 

card corresponding to the customized content and a typical invention will be described. It will be appreciated, however, 

deck builder may build a deck of cards corresponding to the that the system and method in accordance with the invention 

one or more display screens that make up the content for the 5 has greater utility such as to systems for delivering other 

particular information appliance. The deck of cards may types of content to devices having different input/output 

then be converted into a presentation format and protocol for formats. 

the particular information appliance and sent to that infor- The content delivery system in accordance with the 

mation appliance. invention provides many advantages over the conventional 

BRIEF DESCRIPTION OF THE DRAWINGS 10 systems. As described below, the system permits content in 

i . .„ t . , . t a variety of different formats, such as HTML, XML, raw 

FIG 1 is a diagram lUustrating examples of different J be m ^ ^ and men ^ the 

conventional wireless content ^delivery systems; ^nt to be output in a variety of different output formats 

FIG. 2 is a block diagram 1 uTustraung a content delivery ^ tocols ^ ^ „ wml, HTML, HDML, XML, etc so 

system in accordance with the invention; ^ moomin amtenl te displayed on many 

FIG. 3 is a Actional diagram illustratmg a content different appliances and devices having differ- 

debvery system m accordance with the invention; cntscrccn ^ m a pr ; fcrrcd embodimcntj thc system may 

FIG. 4 is a diagram Olustraung the translaUon server of XiStmXML daU structurc ^ me system may include one 

the content delivery system shown m FIG. 2; or more different software apphcations that maybebasedon 

FIG. 5 is a block diagram illustrating an embodiment of me j A y^ language 

the content delivery system of the invention. 20 . iL . L . 4 A . , , 

FIG. 6 is a diagrammatic view illustrating the cootent . In >«=ordance wtththe mvent.^ content providers do not 

connection bandleTof the translation server shown in FIG. to , ch J^ c **" <« ™ 

4. different web servers to send information to various infor- 

* FIG. 7 is a diagram illustrating an intelligent harvesting mation P 3 ^^ the system receives incom- 

method in accordance with the indention; 25 mg content on-the-fly from an Internet content provider 

FIG. 8 is a diagrammatic view illustrating the XML mcrcb y for informaaon ^neration In 

engine of the translation server shown in FI04; more d f me ***** m accordance with the invenUon 

^Tl a . . . c wmAT manipulates standard web pages into a relational markup 

FIG. 9 illustrates an example of the mapping of an HTML . r st%**t\ ^ * r u • c *• 

, . „ \. ^ . v. ... *u language (RML) that permits, for eacb information 

web page into an RML object in accordance with the 7* « J« r *i_ * • c 1 * *u 

j. & . J 30 appliance, a "ser of pages that is more useful to the 

71 . . . individual device and dependant on the device. Now, the 

nG.lOisamagrammabcviewof the layout engine of the ddiv system m accordance with the invention 

translation server shown m FIG. 4; wil i ^ desC ribed. 

FIG. 11 ilhistrates an example of a portion of an HTML- ( / nG 2 ^ a & illustrating a ^tent delivery system 

based web page from the E-TRADE website illustratmg the 35 ^ ^ m mc ^ a system n ffi accor dance 

grouping of different elements of the web page in accor- ^ ^ m translation system 12 may be any 

dance with the mvention; computer system, such as a server or workstation, with 

FIG. 12 is a diagrammatic view of a data structure tree for sufficieot computing power to handle the functions being 

ordering the different groups and atomics of the E-TRADE performed as described below. In a preferred embodiment, 

web page shown in FIG. 11; 40 the translation system 12 may be a server computer that 

FIG. 13 is a diagram illustrating a recursive tree analysis stores one or more different software applications that may 

in accordance with a preferred embodiment of the invention; b e executed by the gPUof the server in order to implement 

FIG. 14 is a diagrammatic view of the tree structure the functions of the content deliyejy,~£Yslem„ described 

shown in FIG. 12 illustrating a collapsing methodology for below. The translation server 12 may allow content provid- 

p recessing the tree in order to create cards in accordance 4S C rs 13 to deliver their content (in different formats as shown) 

with the invention; to one or more different information appliances 15 without 

FIG. 15 is a diagrammatic view illustrating an example of needing to reformat, re-author or rebuild an existing web site 

card formats that are created in processing the tree structure in order to deliver it to the different information appliances 

shown in FIG. 12; 15 using different communications formats as shown. It is 

FIG. 16 is an example of a portion of an HTML-based 50 desirable to provide an upwardly scalable robust server 12 

web page from the CitySearch.com website illustrating the so that software can be coded without requiring updates as 

grouping of different elements of the web page in accor- server loads increase. Rather, such increased server loads 

dance with the invention; may be handled by adding additional hardware compoDents, 

FIG. 17 is a diagrammatic view of a data structure tree for for example memory ^to the server. It is also desirable ttiat 

ordering the different groupings and atomics of the City- 55 the server 12 be platform independent to support any oper- 

Search.com web page shown in FIG. 16; a ting system environment, such as UNIX, Windows, Macin- 

FIG. ISA is an example of a screen shot of a Palm Pilot tosh ^ aQ y other operating system, 

device showing a presentation page of the CitySearch.com L In more detail, the translation server 12 may take infor- 

website shown in FIG. 16; and mation directly from an Internet content provider's web site 

FIG. 18B is an example of a series of screen shots of a 60 m vario us forms, such as HTM_L data, XMLJata, ,o£raw.data. 

cellular telephone device showing a presentation page of the fee&an^thg^^ it^a the to^ation^rvexl2.aad 

atySearch.com website shown in FIG. 16. ^oughajteje^^ 

carrier base station that uses a typical^rnmiiniqations 

DETAILED DESCRIPTION OF A PREFERRED fora^f^^ 

EMBODIMENT 6S format Jfeat is complete ly., custoaujed^tp ttil end juser's 

The invention is particularly applicable to system and deyjcg t yp e and browsing, cap abilities.. Thus, the content 

method for delivering web content to a variety of different delryery^system and method may generate and output WML, 
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HDML, tiny HTML, compact HTML, HTML or XML data provides the tree structure needed by the layout engine as 

that is compatible with the particular information appliance described below during the tree analysis functions. 

15. The information appliances 15 may be any type of device Generally, a tree data structure is a method for representing 

including WAP compliant cell phones, Windows CE a hierarchy of data using tree diagrams formed from nodes 

devices, Palm OS devices, and any other HTML browser 5 and line segments between the nodes. This may be a bit 

based devices, confusing because the DOM's tree structure may be used for 

A For each wireless device 13, there may be a separate both its intended purpose of storing the HTML markup 

' telecom system 14, such as different gateways or proxies to contained in atomics, but also as a way of storing relational 

the different information appliances 15. Thus, in accordance information about those atomics as described below in more 

with the invention, the content provider 13 may create a 10 detaiL 

single piece of content in a single format and the telecom V* The tree analysis function 30 may receive the DOM and 

system 14 does not have to be modified in order to provide dynamicaUy generate pages for a variety of different 

the wirelesscontentinaccordancewith the invention toeach sizcs - £ morc » u typical well known 

separate wireless device 15. In addition, each Internet con- ^ch algonthm, such as that described in 

lent provider 13 also has the ability to customize the 15 ^<*C ^nary of Computer Sc m *8P^ 

p _ . . , . / - ■ , , 13 Technology, is used to recurse through the atomics and 

appearance of their web ate pages using the system s web ^ complicated rules as described below for opti- 
based GUI tool. The GUI tool allows a web pace producer ^ ^ a ny filling pages with content while creating an intelligent 



based GUI tool. The GUI t ool allow s JLSKsLpage producer 
or_fewl©per_JP_grcJ2^^ 
translation server 12 to describe how a particular web page 
is to be translated by the system 1©. In particular, the "drag 
and drop" functionality of the GUI tool in a toolkit that 
allows a "producer" level employee to tailor the "look and 
feel" of the site so that, when the web site is delivered to a 
wireless device 15 via the system 10, the same "look mad 



navigation scheme in accordance with the invention. The 
Card Builder function 32 may apply the protocol specific 
syntax needed by the particular target device and output a 
presentation card 34. In accordance with the WAP protocol, 
the Deck Builder 36 is needed to put the cards into decks and 
into a presentation shoe 38 to optimize transmission latency 
through wireless networks. The system also permits the 



feel" can be delivered as well. Now, more details of the^ intelligent interaction of the information appliance with the 

wireless content delivery system will be described. web site since the system may enable functionality not 

FIG. 3 is a functional diagram of the content delivery typically supported by the information appliance via the 

system and in particular the translation server 12 in accor- virtual browser. The functionality of the content delivery 

dance with the invention. In the example shown, the content system will be described in more detail below. Now, a 

20 may be an HTML web page. The translation server 12 30 prcferredLhara^ 

may include an intelligent harvester 22, a tree synthesizer system m accordance^ 

26, a tree analyzer 30, a card builder 32 and a deck builder FIG. 4 is diagram illustrating a preferred implementation 

38 that generate a presentation shoe 38 that may be sent to of the translation server 12 of the content delivery system 10 

a particular information appliance 15. To intelligently har- in accordance with the invention. In particular, the server 12 

vest an HTML web page not only involves grabbing the 35 may include a conlenLQonnection handjer^jKj, a layout 

content on the site (scraping), but also allows any function- engine 42, an appliance connection handler 44 and an XML 

ality on that site to be enabled on the target information engine 46. The translation server 12 may also include a 

appliance or device. This enabled functionality may include, database 47 that may contain XSL rules used by the XML 

for example, forms, transactions, javascript, cookies, session engine 46 for^corryjutin^ one or 

data and security measures. This enabled functionality is 40 more URL Ids and various device information. In accor- 

possible due to the virtual browser (See FIG. 7) that dance with the invention, each XSL rule may be indexed in 

provides, for example, javascript and cookie proxy engines, the database based on an ID (the ID may contain a URL, a 

so that an information appliance that cannot support javas- name/value pair and cookie information) so that the system 

cript may do so with the javascript being eexecuted on the may determine which rule applies to which incoming URL. 

translation server. As another example, an information appli- 45 Hie device information may be used by the layout engine 42 

ance may not support persistent session that may be required in order to convert the RML data into one or more cards in 

by the web site, but the virtual browser may enable a a deck that may be displayed on the particular device. For 

persistent session so that the web site thinks that it is example, the information may indicate the amount of infor- 

interacting with an information appliance that has the per- mation that may be fit on each screen. The translation server 

sistent session capability. This process also involves apply- 50 12 may also include a long term database 48 that may 

ing a set of rules describing the relational context of the contain cookies so that the system knows which pages have 

content (e.g., how a piece of, for example HTML code, been processed previously and a session database 50 that 

relate to each other). stores a presentation shoe for each device. 

In more detail, the intelligent harvester 22 may receive the Web page requests made by an information appliance 13 

content and generate a relational data structure 24 that 55 to an Internet content provider 13 may occur using, tor 

corresponds to the content as described below in more detail. example the Wireless Application Protocol (WAP), a wire- 

The data structure containing the content in a relational less data transmission standard described in the WAP 1.2 

format in accordance with a preferred embodiment of the Specification Suite located at http://www.wapforum.org, 

invention is a proprietary relational markup language known incorporated herein by reference. Generally, in accordance 

as RML. RML is an XML based language which has the 60 with the WAP protocol, an information appliance 15 may 

advantage of permitting the easy mapping of the content into initiate a request in WML (Wireless Markup Language), a 

a tree structure by the tree synthesizer 26 so that the tree language derived from XML especially for wireless network 

synthesizer may output a typical document object model characteristics. This request is passed to a WAP gateway that 

(DOM) 28. The DOM is a common object model used to then retrieves the information from the Internet. The 

manipulate markup such as HTML such as it disclosed on 65 requested information is then sent from the WAP gateway to 

the W3C web site at http://www.wc3.org. Although it is the WAP client (the device 15), using whatever mobile 

typically used for manipulating HTML or XML, it also to network bearer service is available and most appropriate. 
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The data transmitted over the telecommunications net- 
work 14 (See FIG. 2) is represented as a group of data 
packets and electronically routed over the telecommunica- 
tions network 14 to an appropriate destination node, such as 
an appropriate information appliance 15. The packets of data 
include header information that describes the information 
contained in the data packets, such as the type of data 
contained in the packet, Le. HTML, voice, ASCII, etc., and 
origination and destination node information, and is used by 
the system 10 to route and configure the data packets for 
transmission through the wireless network 14, according to 
well known network routing techniques. 

In accordance with the invention, when an information 
appliance 15 requests a web page from the content provider 
13, the content provider 13 examines the header information 
contained in the data packets and redirects any non-PC 
requests to the translation server 12. In accordance with the 
invention, content from an Internet content provider 13 is 
received by the content connection handler 40. The content 
connection handler 40 mimics a standard HTML browser, 
such as Internet Explorer or Netscape, and functions as the 
interface with a content provider's web site. Non-PC 
requests are redirected to the transl ation server 12 so that the 
web page information can be translated into a data format 
appropriate for and recognizable by the destination infor- 
mation appliance 13. Desktop PC requests do not need to be 
redirected to the translation server 12 since these devices are 
a ble to display web pages instandard HTML or other similar 
formats and do notrequire customuc£^^^^€l^oTm^ 
udnT 

^~The appliance connection hanp^rjW^peratejs^asa Web 
server for. a guesting informatioQ appliance 15, JTbe appli- 
ance connection handler 44 brokers and controls me entire 
transaction between the rcque^ 15 and the trans- 

lation server 12. For example, the appliance connection 
handler 44 may handle the session with each telecom system 
14 or wireless device 15 and may perform various functions 
such as establish a session with the information appliance or 
wireless device 13, retrieye_page jrrfqrmajti^ 
content connectipn_to 
using the X^ 

described below, and transmit the translated page informa- 
tion to a requesting information device 15 with appropriate 
header information. The appliance connection handler 44 
may also determine state information of the devices 15, 
synchronize the devices 15, determine browser and protocol 
information and perform various security operations. 

The XML engine 46 converts the XHTML page, gener-' 
ated j>y the content conne c t ion handle£3S^to..a„prormetary 
marjojpjknguage, RML-^tfae Rela tionaTMark ujjLaDguage, 
usingjmil^se Uha^ st ored in a database4M0ffiris 
a "markup language written irrXMLrE5cn~pa]^type may, 
include 1) a XSL rule-set that specifies what pieces of 
content to display on vary appliances as well as the relational 
structure amongst those pieces of content; and 2) a_perl 
script. XSL is a language for transforming anjfflh^ dbau: 
m eat into anotner XML document as described at the W3C 
website located at JUtPj//www,w.3c.Qr^ The XSL rulesets 
used by the XML engine 46 may be stored in a database 47 
indexed by ID page-type identifiers. The XSL rule-set may 
be used to process hierarchical elements of the content while 
the perl script may be used to process unstructured markup 
in the content. In accordance with the invention, a unstruc- 
tured tag may be used to mark the markup that may be 
processed using the perl script. Each page-type has a rule-set 
and possibly a perl script associated with it. In accordance 
with the invention, the XSL rule-set and the perl script may 
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be generated manually or may be automatically generated. 
The information stored in the database 47 can be used by the 
XML engine 46 when converting page information. 
1^ The layout engine 42 processes the content to convert the 
relati onal RML conten t, received from the XML engine 46, 
muTflevice~and protocol specific mark-up language formats. 
Formatted output provided by the layout engine 42 will be 
referred to herein as a "presentation shoe." For purposes of 
this description, a presentation shoe consists of "decks'* and 
"cards." A deck is the smallest unit of content that is sent to 
a device. In certain protocols, such as WML and HDML, a 
deck can contain multiple cards. The presentation shoe thus 
contains the original HTML content of a webpage that is 
reformatted into an appropriate format language and tar- 
geted at an information appliance 15. The presentation shoe 
is formatted specifically for the target appliance's screen 
size, user interface and protocol. Advantageously, t^_pxs.- 
senudon^oe_Kjcreate^ynamKally in accordance with 
relagi^]^ will be 

described in detail herein. Information about the presenta- 
tion shoe can be stored in a session database 50 that may be 
in communication with the layout engine 42. Now, the 
details of the operation of the translation server will be 
described. •** 
FIG. 3 is a block diagram illustrating operation of the 
wireless content delivery system 10 of the invention. As 
described above and shown in FIG. 5, an information 
appliance 15 may request page information of a particular 
URL website from a content provider 13. The request by the 
information appliance 15 may be redirected, for example, 
via a JAVA servlet 60, to the translation server 12. The 
appliance connection handler 44 examines header informa- 
tion from the requesting data in order to determine a target 
device 15, protocol and browser configuration. The appli- 
ance connection handler 44 then requests the desired URL 
information from a content connection handler 40. The 
appliance connection handler 44 may relay information 
about the requesting wireless device 15 to the content 
connection handler 40 such as a requested URL address of 
a content provider 13, a session ID, a user ID and a wireless 
device's browser capabilities. The content connection han- 
dler 40 retrieves the requested information from the content 
provider 13, renders any JavaScript, stores any cookies and 
returns the requested information as XHTML data to the 
appliance connection handler 44. 

The appliance connection handler 44 then requests the 
XML engine 46 to convert the received XHTML data to 
RML data and assign atomics into the relational structure 
according to page type so that presentation cards can be 
created and placed in a presentation shoe so that the cards 
can be transmitted to the target device 13. The XML engine 
46 will be described in more detail below with reference to 
FIGS. 7-9. 

The application connection handler 44 then conveys the 
RML data output from the XML engine and device infor- 
mation (stored in the database 47 shown in FIG. 4) to the 
layout engine 42 so that the layout engine 42 can generate 
a device and protocol specific set of cards that are served to 
the requesting appliance 15 by the appliance connection 
handler 44, via the presentation shoe. 

In more detail, the layout engine 42 may include a layout 
processor 62, a preprocessor 64, a recursenode module 66, 
a card formatter 68, a deckbuilder 70, a contentcutter 72, a 
ordernodes module 74, a guidehandler 76, a recursenode and 
recurse atomic module 78, 80, a navigation builder 82 and 
a shoebuilder 84. The layout processor 62 may receive the 
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XML DOM, the device information from the database 47 appliance. This function is performed by an HTTP client 
and the shoe ID and may forward the shoe information to a connection module 90. The HTTP client connection module 
preprocessor 64. The preprocessor may forward the XML 90 functions to maintain a communication session with the 
DOM and class information to a content cutter 72 that content provider 13 and handle cookie and security (SSL) 
operates on the RML data to remove any classes of infer- 5 information. An HTML data processor 92 may render Java- 
mation that cannot be displayed on the particular infbrma- Script data in a proxy fashion for the requesting information 
tion appliance or wireless device 15 based on the device appliance 15. With a scripting language, such as JavaScript, 
information from the database 47. Once the classes are the client-side source code is embedded directly into the 
removed, an ordered structure of atomics is generated from HTML page and a client-side software plug-in that interprets 
the RML data by the OrderNodes module 74 so that, for 1Q that language is automatically activated while the HTML 
example, a tree structure describing the relation of different page is being displayed. TTiis client-side software is not 
portions of the content information to be displayed on the available on most information appliances, requiring our 
device 15, can be created. A GuideHandler module 76 servers to proxy this function. An HTML cleanup module 94 
permits the system to put a template on each card of the first may generate XHTML page information so that the trans- 
card in a shoe delivered to an information appliance. For 1S lation server 12 can translate the received HTML page 
example, the logo of a company may be displayed in the information to a format readable by and appropriate for an 
comer of each page displayed to the information appliance. information appliance 15. Now, a method for intelligent 
The navigation builder 82 may permit links between cards harvesting in accordance with the invention will be 
and other items to be placed on a card. For example, a described that includes some of the elements described 
particular web page may always have one or more naviga- 2Q above. 

tion links on each page (e.g., support, products, etc.) that fig. 7 is a diagram illustrating a method for intelligent 

may be placed on each card by the navigation builder. harvesting of web pages in accordance with the invention 

The RecurseNode module 66 and 78 may then operate on that uses the elements shown in FIGS. 3 and 4. In particular, 
the tree structure to group related content information so that the content 20 is fed into a virtual browser 100 as described 
appropriate cards can be created for presentation on the ^ briefly above which passes on the content to the XML 
wireless device 15. The recursion of the nodes in the tree is engine 46. Using the particular page-type rules sets stored in 
described in more detail below with reference to FIG. 14. In the database 47, the XML engine 46 may convert the content 
processing the tree structure, the card formatter module 68 into relational markup language (RML) format 24. In more 
may generate presentation cards that include the appropri- detail, a session on the Internet is handled by the virtual 
ately grouped atomics as determined by the recursenodes 30 browser KM) located on the translation server 12. This virtual 
and recurseatomic modules 66, 78, 80. The deck builder browser provides the important functionality of proxying 
module 70 may then groups the plurality of presentation javascript (using a Javascript proxy engine 102) and cookies 
cards into decks so that the deck can be organized into an (using a cookie proxy engine 104) for the target devices. For 
appropriate presentation shoe to optimize the wireless link each page-type, an XSL rule-set is stored (See the W3C 
between the system and the information appliance. To 3S specification at http://www.w3.org/TR/2000/WD-xsl- 
perform the above function, the deckbuilder 70 may include 20000112) in the database 47. This rule-set contains infer- 
tile navigation builder 82 that builds the navigation path mation about the division of content into atomics, content 
between the cards in the deck and the shoebuilder 84 that classification, and the relationships amongst these atomics 
actually builds the presentation shoe. for the particular page. The XML Engine 46 uses these XSL 

Once the presentation shoe is completed by the layout 40 rule-sets to translate the HTML into RML. The division of 

engine 42, it returns the presentation shoe to the appliance content into atomics can be generated either automatically or 

connection handler 44. The appliance connection handler 44 by a producer using a tool set. The content classification 

then may relay the presentation shoe, for example via the information is used to determine what content and function- 

JAVA servlet 60 to the appropriate requesting wireless ality are appropriate for which classes of devices. The 

device 15 so that the content information from the content 45 content classification information may be generated either 

provider 13 can be displayed in a format appropriate for the manually or automatically. Now, more details of the XML 

wireless device 13. engine in accordance with the invention will be described. 

FIG. 6 is a diagrammatic view illustrating the function of FIG. 8 is a diagrammatic view illustrating more details of 

the content connection handler 40. The content connection the XML engine 46. As described above, the XML engine 46 

handler 40 establishes an on-line session with a web site in 50 e xtracts content from dyna mically changing XHTML. infpr- 

order to receive webpage content to be converted- In a mation and generates a corresgoli^ 

preferred embodiment, HTML pages are retrieved from a RMOyej^ XSL 

specified URL website and the retrieved HTML information rulesets defiiie the transforn^ used to convert 

is formatted as XHTML, an XML compliant HTML format between An j 

utilized by the translation server 12 and represented by a 55 example of an XHTML document Sg inverted into j 

document object model (DOM). A DOM is a common object RML using an XSL rulesets is described below, 

model used to manipulate markup. This structure is used to „g In operation, the XML engine 46 m ay receive a pag e-type^ 

translate the HTML page information to a format that can be ^designated by URL, name/value pairs, and i CQol^informa- , 

translated by the XML engine 46 to be described in detail tiQn a nd pages of XHTME^informaiion ^ from the content / 

below. The content connection handler 40 may communicate 60 connection handler^ 112 / 

with a long-term database 48 (See FIG. 4) that is configured mayl^ive Tc^ain XSLIrulesets from the database 47 that / 

to store cookie information about the web sites from which dergjfg^ ^ the URL wetenTis to be>> / 

page information is retrieved. coljyjjljg^^ 

As described above, the content connection handler 40 toe^ajm^Q^ 

functions as the interface with a content provider's website. 65 ualjtflML pjggjnfaniation has been received. Ir^conjunc- 

In a preferred embodiment, it acts as a virtual browser, tio^Mnot^J^^dejets, determined by~ffie~fiasrjtable 112, an 

proxying browser functionality on behalf of the information XSLvtransfonn processor 110 may convert the received 
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XHJ^information to RML . WormatiotftfiS^pfe^td 
to the layout engine 42 so that it can be oonveiteiaTOo^vto 
and protocol specific mark-up language^formats. 
Additionally, the XSL rulesets may permit the Internet 
content provider 13 to control the look and feel of the 
content (e.g., the location and/or inclusion of certain ele- 
ments of the content together on a single card) regardless of 
the wireless device 15 on which the content is being dis- 
played. Now, an example of an HTML page prior to 
conversion, an XSL ruleset that may be used to convert the 
particular HTML page and the resulting RML page in 
accordance with the invention will be described. 

Below is an example of an HTML web page that may be 
converted in accordance with the invention into RML. 
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SAMPLE HTML CODE FOR WEB PAGE 

<html> 
<head? 

<title>Weloome to Foo.com I <Ai tie > 
</bcad> "* v 

<body> v \ 

<p><a hicf-"htlp7Avww.foo.com/login/ T '>Log Io</a></p> \ 

<p><a hrcf="httpVAvww.foo.coin/signup^ , >sign Up<Mx/p> 

<hl align»''centei^>Foojoam</hI> 

<h4 align-"center">Your source for all things foo!</h4> 

<tablc> 
<tr> 

<th colspan»- u 2" align- M center">Foo Products</th> 

</tr> 
<tr> 

<td>Foo Fighter</td> 

<td>$19SS<M> 

<!U> 

<tn> 

<td>Foo Peacemaier</td> 
<td>S29.95</td> 

<Ar> 
<ti> 

<td colspan- , *2* , > 

<a hrcf="http:www.foo.com/buy'*>Buy these 

wonderful Foo's!</a> 

</td> 

</tr> 
<^table> 

<p>(c) 2000 Foo.com </p> 
</body> 
</html> 



</pano 
</navigatiou> 

<group name^'Mainr class="l" scquential=* 4 sequeiitial"> 

<xsl:apply-templat£S sd ccU»**body"/> 
</graup> 
</rml> 
</xsllcmplate> 
10 <Ksl:template match- M body M > 

catomk name«*TMc** class-*l**> 

<b><xslrvahJc-of select-"h47x/b> 
</atomio 

<xsl:apply-tcmplates select»* 4 tablc , 7> 
<frsllemplat£> 
!5 <xsl:template match-"table"> 

cgroup name=*Toy Table" class**" 1"> 

<atomic name~T&h!e Tale" class-" 1" sequenlialo-gequential'^ 

<b><xsl:value-of Kelect-"tr/th7></b > 
</atomic> 

<xsl;for-each select="u(td[iiot(@coLspaD)If"> 
<group name-Toy" class-"l"> 
<»I:for-e«ch se!ect- M td"> 
catomic name«"Eatry"> 

<xsI:value-o£ selcct-".7> 
</atomio 
</xsl:for-each> 
</group> 
<tfsst:for-eacli> 
</group> 
<yxslleinplate> 
</sal stylesheet* 



The resulting RML code in accordance with the invention 
is set forth below: 
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<IDOCTYPE html PUBLIC "-//W3Q/DTD HTML 4.0 Transi- 
tionaI//EN"> 
<nnl> 
<head> 

<titlc> Welcome to Foo.com! <Aiile> 
</bead> 



Now, a XSL ruleset that may be used to convert the above 
HTML page into RML in accordance with the invention is 
set forth below. 



45 



<xsl:stytesheet jmlns3sl»"http ^/www.w3. org/1 999/XSI7Traasform T '> 
<xsl:template niatch-"*IrV<xsl^^y-templatea/>^ial:template> 
<rsl:template malch-neilO(@*"><xsl^alue-of seIecto*\7x/xsl :tcm- 
plate> 

<xsl:template matcb-Titml'V 
<nnl> 

<head> 

<titlexml.-valae-of select-7/tide7>^/title> 
</head> 
<:guide> 

<navigation> 
<pane> 

<xsl:for-each selectVbody/p/a"^ 

<atomic name-"Goto:" class-" 1" 

column«"column"> 

<a hrcfo"{@href}"xxslrvalue-of 

Gelect^ M nonnalize-flpace(.)7>Va> 

</atomio 
<yisl: for -each > 
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<navigation> 
<pane> 

catomic name="Goto:" class*»"l"* coliurm»**oolainn"> 

<a href-"httpyAvww.fbo.coin/logm/'>Log ln</a> 
<^&tomio 

<atomic name»"Goto: > * class-**l" column=«'*cohiinii"> 

<a href~ w http^A | 7ww.C6o.com/signup^'>SigD Up</a> 
<#tfomio 
</pane> 
</navigatioa> 
</guide> 

<group name-^Main" class-**!** sequential="sea ) uentiflr> 
<atomic name-Title "class^'lV 

<b>Your source for all things fooI«tf» 
</atonrio 

•cgroup name=**Iby Table'* class=> M l**> 
otomic name-Table Title** dass-*'l** sequent ial-"scqucntial > *> 

<b>Foo Products </b> 
</atomio 

<group name-Tby Table** class- 4 *! **> 
<atomic name="Entry > '>Foo Fighter </ato mic> 
<atomic niiine=*'Entry">$19.95</atoinic> 

-c/group> 

•cgroup namc=**Tby" dass»T*> 
<atomic name=*'Entry**>Foo Peace maker</atomic> 
otomic name-*'Entry M >$29.95</atoinic> 
</group> 
</group> 
Cgroup > 
</rml> 
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The above RML code may also include the unstructured 
tag as described above that m surrounds a particular portion 
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of the mark up language that is unstructured and may be 
processed using a perl script Now, a graphical example of 
HTML content being converted into the RML in accordance 
with the invention will be described. 

FIG. 9 jga diagram illustrating an HTML page 120 be ing ,| 5 
converted int o an RML page 122 in accordance with the" 



mvejUonTTb'r purposes of the conversion, each piece of 
^isCTete content with presentation structure or no structure 
124 is mapped into an atomic_12 6 in the RML code. It should 
be noted that thd hyperlinks within a piece of content are 
maintained in the atomics. In addition to the atomics, groups 
of content 128 in the HTML page may be converted into a 
group 130 in RML. In this example, the group is non- 
sequential (e.g., the content in the group does not need to be 
displayed sequentially). Now, an example of the division of 
a graphical web page into atomics and groups in accordance 
with the invention and the layout engine 42 will be described 
in more detail. 

FIG. 10 is a diagram of the layout engine 42 in accordance 
with the invention. As described above, the layout engine 42 
formats a content source for a specific device's screen and 
ihb^evX^^^^^^J^ layout engmelEfmay include the" 
content cutter 72, the layout processor 62, the card builder 
68, and the deck builder 70. Tnejconlejp^cutter 
content of fo rmat and conte nL classes not appropriate for the 

^JtotjcjreajejLn 



specific devtejrpj 

XML representation of the rccHvaTbligirM webpage. The 
layout processor 62, using prior knowledge of the device 
type and the content, dynamically devises an optimal layout 
and navigation structure for the particular device 15. Thus, 
the output of the layout engine 42 is a presentation shoe in 
the appropriate presentation protocol for a particular appli- 
ance 15. 

Presentation shoes are built from the bottom-up, resulting 
in the highest priority "atomic" being placed on a card first. 
An atomic is the smallest unit of a web page that encapsu- 
lates an idea. For example, an atomic may be a paragraph of 
text, a heading, a link to a news story, a picture, etc. Atomics 
may be grouped together to reveal relationships between 
them. Groups may be nested to form a complex relational 
hierarchy. These groups can be placed on cards so that 
customized presentation pages can be transmitted to a device 
15. 

In operation , an RML document i s receive d by the layout 
engine 42 from which the content cutter 72 cuts data classes 
th at are not appropriate f or the requesting device 15 Jq^ 
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g enerate an XML document containing the cut content, 
"device mTormation that specifies the target device, and 
protocol information that specifies the target protocol. 

Card creation will be described with reference to FIGS. 
11-13. An example of a portion of an HTML web page 170 
from the E-TRADE website is shown in FIG. 11. In the 
Figure, the innermost dashed boxes designate atomics while 
the boxes enclosing them constitate^^ojjps. At the top 
portion of the page 170 is a quote ippkrupufonn 171. The 
quote look-up form 171 is made up of three atomics, a 
"Quotes" title portion 171c, an entry boK„121^and a "Go" 
submission button 171c. Further, the jnaxtet.^r^b^72a, 
table_172fc, and Fool.com a^ eacjj 
related at5m<»„and ^ 

172. In addition, each elemenrinThe market graph 172a may 
also be an atomic so that "NASDAQ" is an atomic, 
"2756.27* is an atomic, the down arrow is an atomic and 
"-5.4S" is an atomic. Finally, the TheStreetcom logo 173a, 
and the four news stories 173b-e are each related atomics 
and are grouped together as group 173. All of the groups 
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171, 172, 173 make up the root group 170. These groups 
constitute the relational hierarchy for this portion of the 
E-TRADE website. 

To display this website on a display device 15, such as a 
Palm Pilot or a Windows CE device, the groups and atomics 
need to be organized and placed on cards that make up the 
presentation shoe. Cards are created by examining how 
groups best fit onto the cards. A tree data structure can be 
generated from the RML object As described above, nesting 
groups describe the relational context of content contained 
in a webpage. In RML formatting, each group typically has 
three primary attributes: name, class and priority. The name 
attribute is used during a navigation pass through the tree as 
a link description to the group if needed. The class attribute 
is used by the content cutter 72 to determine the content in 
the webpage that is appropriate for a specific device's 
capabilities. Thus, the class attribute allows different levels 
of content to be presented to different classes of devices. For 
example, the general classes of devices are shown in the 
following table, but the number of classes may be increased 
or decreased. 



Class General Device Class 

1 Cellular telephones - very limited screen size 

2 Palm Pilot - low resolution, black and white or monochrome 
display 

3 Windows CE - high resolution, color display 



30 

The priority attribute indicates the importance of each 
portion of the content and is used by the layout generator 42 
during preprocessing to order the content into appropriate 
groups. To further organize the groups, each group also has 
35 Boolean attributes, such as columnize, sequential and other 
attributed such as association. The association attribute may 
have the values of keep together (keep the atomics together), 
isolate (isolate the atomic on its own card) or null. The 
columnize attribute indicates whether an atomic/group 
40 should be placed adjacent another atomic/group on a card. 
The sequential attribute specifies how the navigation struc- 
ture should be designed. For example, atomics containing 
paragraphs of a story would be grouped as sequential 
information. In contrast, a list of links to news stories would 
be grouped as non-sequential information. The keep 
together attribute indicates to the layout engine 42 to keep 
children of a node together on the same card if possible. The 
isolate attribute specifies that the group is be placed onto its 
own card. 

In addition to the above, knowledge of the number of 
characters and pixels horizontally and vertically on the 
target wireless device 15 is desirable. In addition, for devices 
15 that allow fonts, a font width calculation may need to be 
made. A bandwidth and screen dependent variable may be 
assigned to determine how much content is allowed to scroll 
before a new presentation page is created vertically and a 
link created to subsequent presentation pages. Another vari- 
able may store the minimum container width (how narrow 
the text container can be made so that another container can 
be put along side it). 
vU FIG. 12 illustrates the relational structure of the groups in 
the E-TRADE web page shown in FIG. 11 that may be 
represented in a tree structure. This separation of content 
fromstyle™in a relatioloaTway that can be represented in a 
tree allows for theje-formattiog of the content in accordance 
withjyte 

refers to the outermost group of the tree and indicates the 
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E-TRADE web page 170. The nodes 132-186 (roman 
numerals I, II, and III) are the "children" of the root node and 
refer to the atomics 171n, 1716, 171c of the Quotes portion 
171 of the E-TRADE web page 170. A node 188 (roman 
numeral IV), another child of the root node, refers to the 
market group 172 and includes children nodes A-C (market 
graph 172a, tabic 1726 and Fool.com advertisement 172c of 
FIG. U). Similarly, a node 190 (roman numeral V) refers to 
the news section 173 and includes children nodes A-E (the 
news story links 173a-e of FIG. 11). The flexibility of trees 
and directed graphs allows them to represent any structure 
found in a web site. Thus, the trees provide a method to 
organize the complicated relationships amongst content in a 
web page for formatting as well as provide navigation links 
between cards in a presentation shoe. 

To provide a simple tree structure that can be efficiently 
navigated, the tree is sorted by priority. This pre-processing 
reorders the tree to put the content that will appear on the 
device 15 first in the left-most nodes of the tree. After the 
tree has been sorted, the groups and content are evaluated in 
order to assign the content to one or more cards using the 
attributes described above. A depth-first search algorithm as 
described above, using a recursion technique that considers 
outgoing edges of a vertex before any neighbors of the 
vertex, may be used to implement this evaluation as 
described below with reference to FIG. 13. As this search 
evaluates each vertex, it attempts to optimally fit the atomics 
onto cards. 

FIG. 13 is a flowchart illustrating a method 20® for 
realising through the tree generated by the XML engine in 
order to generate customized panes and cards that may be 
displayed on an information appliance or wireless device 15 
in accordance with the invention. The recursive method 
starts with the root node — the top node in the tree (See FIG. 
12). In this case, the root node is the top-level group node 
that encompasses all of the content that shall be placed into 
cards. The goal of the recursion is to place content into cards 
in such a manner that the context of the content remains 
while creating an intelligent manner of accessing the content 
(an intelligent navigation scheme) for each different infor- 
mation appliance or wireless device that may have different 
display capabilities. As described above, all of the content 
resides in atomic nodes and all atomic nodes are children of 
groups. An example of the groups and atomics are illustrated 
in FIG. 11. Groups can contain other groups or atomics 
while atomics can only contain content, for example, 
HTML. 

FIG. 13 is an example of how the recursion method may 
be accomplished but many other implementations are pos- 



is passed to Recuse Atomic step 203. In Recurse Atomic, an 
atomic object is created using the node's contents. This 
Atomic is then placed on to the current pane in step 210. 

There are certain circumstances, such as columnizable 
atomics, where the current pane is added to the card and the 
current atomic is placed in a new pane. A columnizable 
atomic is one that can be placed adjacent to another atomic. 
When all of the children of a group have been passed to 
Recurse Node in step 212, Recurse Group 206 then tries to 
add the current pane to the current card in step 214. If the 
pane fits on the card (the test occurs in step 216), then 
Recurse Group returns in step 218 to Recurse Group 206 and 
the recursion continues. If the pane doesn't fit on the card, 
an exception is handled in Recurse Group where one of two 
15 things can happen. If the pane contains sequential atomics 
(as tested in step 220), then the pane is split, and as many of 
its atomics that can fit on the current card are placed into a 
new pane and put onto the card (steps 222. A link to the next 
card is placed at the end of the full card and then the card is 
placed in the shoe and a new card is created 224. The new 
pane with the remaining cards is put onto the new card in 
step 226 and the recursion continues in step 228. 

If the pane contains nonsequential atomics (as tested in 
step 220), then the current node is cloned (duplicated). Then, 
sequential atomic nodes that are links to the original children 
are created in step 230 and added as children to the original 
node while the original children are removed. The cloned 
node contains the original children. The Recurse Group step 
206 then iterates through these original children by passing 
them to the Recurse Node object 202. If no exceptions occur, 
Recurse Group 206 then iterates through the original node's 
new children. This iteration is handled in the same manner 
as the original iteration through the group's children. Once 
all of the atomics have been placed in panes and these panes 
placed in cards, the recursion should return to the root node 
and the recursion ends. 

FIG. 14 illustrates a collapsing methodology for process- 
ing the tree to create cards that can be transmitted to a device 
15. The recursion of the tree as described above begins at the 
root node and proceeds to its first child node 182 (node I), 
the "Quotes'* text 171a. A new columnizable pane is created 
and atomic I (171a in FIG. U) is added to it. This pane, as 
it is closed, is added to a newly created card. The recursion 
process then continues to the next node 184 (node II), the 
entry box 1716, and adds this atomic to a new columnizable 
pane. Similarly, node ID, the "Go" submission button 171c, 
is added to a new columnizable pane. These panes are passed 
back to the root node and a card 240 is created that includes 
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sible. Recursion through the tree representation of the web so ^ e tnree V^s- 



site begins at the root node. The depth-first search proceeds 
to the first node without any children. This is the first atomic 
having the highest priority. In addition to the previously 
described attributes, two additional attributes are relevant to 
the creation of cards that are placed in the presentation shoe., 
"panes'* and "frames." Multiple atomics can be placed, top 
to bottom, into a pane. Panes can be assigned to cards 
adjacently (columnized) by placing them into two frames 
side-by-side on the card. 

The recursion for the tree starts by passing the root node 
to the Recurse node object 202 which determines what kind 
of node (e.g., a group or an atomic) is being manipulated in 
step 204. In the root node's case, the passed node is a group 
and it is passed to the Recurse Group object 206. The 
Recurse Group object iterates through the node's children, 
passing each of them back to Recurse Node 202. In the case 
where the node Recurse Node receives an atomic, the node 



The recursion process continues to node 188 (node IV), 
the market group 172, and immediately to leaf node A, the 
market graph 172a. A new pane is opened and atomic A is 
added to it. The process continues to leaf node B, the market 
55 table 172b, Atomic B is added to the pane, as is atomic C, 
according to the same processing, and the recursive process 
continues through the tree to node 190 (node V), the "News" 
section 173. A new card 242 is created that includes atomics 
A-C of group IV. Similarly, atomics A-E (news stories 
50 173a-e of FIG. 11) are placed on a third card 80c and the 
recursive process completes. 

The output of the card generation is shown in FIG. 15. 
Each of the cards 240, 242, 244 is shown. As shown in the 
Figure, the first card 240 includes atomics 171a, 171/), 171c 
65 from FIG. 11 wherein each atomic is in an individual pane. 
Tne second card 242 includes atomics 172a, 1726, 172c 
from FIG. 11 all grouped as a single pane on the card 242. 
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The third card 244 includes atomics I73a~e grouped as a 
single pane. Whether atomics may be grouped on a card as 
a single pane or in individual panes or any combination 
thereof depends primarily on the contextual information 
relating to the atomic. 

Navigation amongst these cards is determined by the 
sequential attribute of the root node. Thus, in the non- 
sequential case, a card with links to the three cards 240-244 
shown in FIG. 15 would be created. In contrast, for a 
sequential case, navigation between the cards 240-244 
would begin at the first card 240 and links at the bottom of 
the first and second cards 240, 242 would lead to the next 
successive card (e.g., the second card 242 and the third card 
244) respectively. As mentioned above, the navigation 
between cards can be determined from the tree. 

The recursive process described above was simplified 
because each child of the root node fit on am individual 
page. The following description, with reference to FIGS. 
16-18, illustrates a more complicated card creation process 
in which the dynamic feature of the layout engine 42 are 
shown. 

FIG. 16 illustrates an example of a web page 290 from the 
CityScarch.com website. In the Figure, two different group- 
ings are indicated. The first grouping 291 includes the 
"Review of the Week" contextual information. In the group 
291, four atomics 291a-d are shown that reference the title 
291a, text 291fr, picture 291c and mini review 291d. The 
second grouping 292 includes the "New Releases" contex- 
tual information. In the group 292, there are different num- 
bers of atomics for different sub-groups. Each of the sub- 
groups 293-296 are grouped according to the different 
movies. Atomics within these subgroups represent the 
names 293a, 294a, 295a, 296a of the movies, pictures 2936, 
2946, 2956, 2966 of the movies and mini reviews 293c, 
294c, 295c, 296c of the movies. Sub-group 293 also includes 
another atomic 293d referring to the title "New Releases." 
These groups and sub-groups constitute the relational hier- 
archy for this portion of the CitySearch.com website as 
determined by the XML engine 46 in accordance with the 
invention using the XSL rule set and perl script for the 
particular web page. 

The tree structure for the CitySearch.com web page 290 j 
is illustrated in FIG. 17. In the Figure, the root node 300 
("1") indicates the CitySearch.com website group. Child 
nodes 302, 304 (Nodes 2 and 3) indicate the two groups 
"Review of the Week" 291 and "New Releases" 292. The 
atomics a-d of node 2 refer to the title 291a, text 2916, 
picture 291c and mini review 291rf of the "Review of the 
Week" group 291 of FIG. 16. The child nodes 306-312 of 
node 3 (nodes 4-7) and their associated atomics a-c of the 
tree represent the different movie sub-groups 293-296 
included in the "New Releases" group 292. 

In accordance with the invention, the layout engine 42 
recursively iterates through the relational tree structure, 
d ynaim ic ally building cards foran appropriate screen size of 
a target dma^i5T7^escribed above, the recursive process 
mayjneorporate a depth-first recursive algorit hm to imp le- 
ment prqcKSii^j^ 
begins at nodeJLjtojro^ 
node 2, the "Review of the Week",group 291, to se^jic¥Er 
m e^jigbest priority node th at does not h ave any , associa ted 

291 a of the group 291" Atomic 2a E placed ontoTphTahd 
the pane state is set to non cohimnizable. Next, the recursion 
process proceeds to node 26, the text portion 2916 of the 
group 291. The atomic state of the text portion 2916 is 
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compatible with that of the title portion 291a so the atomic 
2916 is added to the current pane. This process continues, as 
described above, until the entire tree has been processed and 
the atomics have been placed in panes on cards. 
5 Depending on the target device 15 screen size capability, 
different presentation screens may need to be generated by 
the system 10 so that the information may be viewed on the 
device 15. For example, for a Palm Pilot device, the con- 
textual information associated with node 2, the "Review of 
the Week" group 291, may fit entirely on the display screen 
so that the associated atomics may be included on a single 
card. However, the "New Releases" group 292 and its 
associated subgroups 293-296 may not fit on the card and 
new cards may need to be created in order to display this 
information on the Palm Pilot. Since node 3, the "New 
Releases" group 292, is a non-sequential group, new cards 
for each of the subgroups 293-296 may be created and the 
appropriate links may be i nserted into the cards so that a 
vie wer of the information on the device can naviga te 
Dctw £ £ D display pages. An example ot' the presentation 
information of the CitySearch.com web page 290 shown on 
a Palm Pilot device is shown in FIG. ISA. — 
In contrast, for a cellular telephone device that has a 
limited display screen size, a different alternative of display- 
ing presentation information is possible and automatically 
generated by the system. In particular, depending on the size 
of the display screen, the system 10 may determine that 
cards are divided at different points along the creation 
process and the appropriate links inserted thereon so that a 
viewer of the presentation information on the phone device 
can navigate between the different display pages. FIG. 1SB 
shows an example of the presentation information of the 
CitySearch.com web page 290 shown on a cellular tele- 
phone device. In particular, the information displayed on the 
full screen display of the Palm Pilot (See FIG. ISA) are 
broken into one or more smaller screens wherein, for 
example, the user of the phone must select the "Go" button 
to move down through the various titles of the movies that 
are reviewed. 

While the foregoing bas been with reference to a particu- 
lar embodiment of the invention, it will be appreciated by 
those skilled in the art that changes in this embodiment may 
be made without departing from the principles and spirit of 
the invention and defined by the appended claims. 
What is claimed is: 

1. A system for intelligently harvesting information from 
a data source for one or more different information appli- 
ances having different input/output formatting capabilities, 
comprising: 

means for receiving web-based content information of a 

first input/output format; 
means for translating the received content information 
from the first input/output format to a different input/ 
output format that is recognizable by a specific device; 
and 

means for providing the translated content information to 
the device. 

2. The system of claim 1 wherein the device is a wireless 
device having a predetermined display screen size. 

3. The system of claim 1 further comprising means for 
receiving information from a device that indicates the input/ 
output format of the device so that the translated content is 
generated based on the input/output format of the device. 

4. A system for translating content information from a first 
formatting language to a second formatting language, com- 
prising: 
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a host system including content information of a first guage can be generated from the remaining content infor- 

fonnatting language; mation of the second intermediate language; a layout pro- 

a translation server remotely connected with the host cesser for dynamically generating a layout format for the 

system, the translation server configured to receive the content information of the second formatting language that 

content information of a first formatting language and 5 is optimized for the display screen size of the device; and a 

translate the received content information from the first protocol processor for generating the content information of 

formatting language to a second formatting language; ™ e second formatting language. 

and 17. The system of claim 16, wherein the layout format 

a device remotely connected with the translation server, comprises a presentation shoe that includes at least one 
the device configured to receive the content informa- 10 P^ 0 ^ 011 card, each presentation card representing a 

tion of the second formatting language and display the d *? U ? pa * c °. f tbc cach presentation card including 

content information a * least a portion of the content information of the second 

3. The system of claim 4, wherein the device is a wireless ^T^^^' tlT* ^^T^ ^T^™ T ^ 

, „ A . . j j • 1 . in the presentation shoe are linked so that the content 

device having a predetermined dispky screen size information of the second formatting language can be selec- 

6. The system of claim 4, wherein the host system 15 tively oa ^ device 6 5 * 
comprises an Internet web server that includes at least one 18 . system of claim 17> wherein me prese ntation 
web page thereon, the web page having a specific URL cards include at least one pane portion thereon and wherein 
Internet address and including the content information of the the content information is separated into atomic groups of 
first formatting language. content information, the atomic groups of content informa- 

7. The system of claim 4, wherein the translation server 20 tion being organized and assigned to the pane portions of the 
comprises means for establishing a communications session presentation card so that formatting of the content informa- 
with the host system so that information content of the first tion can be optimized for the display screen on the device, 
formatting language can be received by the translation 19. The system of claim 18, wherein the presentation 
server; means for receiving the information content of the cards in the presentation shoe are transmitted to the device 
first formatting language; means for converting the infor- 2s 50 tnat me content information of the second formatting 
mation content of the first formatting language to informa- language can be displayed on the device. 

tion content of an intermediate formatting language rccog- 20 * Tnc system of claim 7, wherein the device formatting 

nized by the translation server, means for translating the means further comprises means for receiving information 

information content of the intermediate formatting language from a device mat indicates me input/output format of the 

to information content information of the second formatting 30 device 311(1 means for formatting the content information of 

language; means for formatting the content information of !° e second fonnatting language based on the received device 

the second formatting language so that the content informa- information. 

tion can be transmitted to and displayed on the device. 21 - Tne svstem of d*im 4, wherein the translation server 

8. The system of claim 7 wherein the information content former comprises a content connection handler. 

of the first formatting language comprises HTML. 35 22 - ^ system of claim 21, wherein the content connec- 

9. The system of claim 7, wherein the intermediate Uon handlcr comprises a client connection module config- 
formatting language is XHTML. urcd to esta °li sn and maintain a communications session 

10. The system of claim 7, wherein the translation means ^ ^ host svstem » a data processor configured to receive 
comprises an XML engine. me c° ntent information of the first formatting language from 

11. The system of claim 10, wherein the XML engine 40 me host a cleanup module configured to convert the 
comprises a hashtable module for comparing the URL content information of the first formatting language to 
Internet address of the received content information with a content information of the intermediate formatting lan- 
group of predetermined rulesets that define a criteria for SPW means for translating the information content of the 
translating the content information to the second formatting intermediate formatting language to information content 
language; and a transform processor configured to convert 45 uaformat * ,n of mc second formatting language; and means 
the received content information of the intermediate format- for fo™»tfng the content information of the second for- 
ting language to content information of a second intermc- matting language so that the content information can be 
diate formatting language. transmitted to and displayed on the device. 

12. The system of claim 11, wherein the second interme- 23 • A method for intelligently harvesting information 
diate formatting language comprises RML. 50 from a **** for one or m °re different information 

13. The system of claim 11, wherein the group of prede- appliances having different input/output formatting 
termined rulesets define a branding criteria for the content capabilities, comprising: 

information of the second formatting language so that the receiving content information of the first formatting lan- 
content information of the second formatting language dis- guage from an Internet web server; 
played on the device is similar in appearance to that of the 55 converting the received content information of the first 
content information of the first formatting language received formatting language to an intermediate formatting lan- 
from the webpage. guage so that the received content information can be 

14. The system of claim 13, wherein the group of prede- processed and formatted in accordance with a display 
termined rulesets comprise XSL rulesets defined by the host capability of a device; 

system. 60 translating the processed content information from the 

15. The system of claim 7, wherein the formatting means intermediate formatting language to content informa- 
comprises a layout engine. tion of the second formatting language; and 

16. The system of claim 13, wherein the layout engine trarismitting the content information of the second for- 
com prises a content cutter for extracting portions of the matting language to the device so that the content 
content information of the second intermediate language that 65 information can be displayed on the device. 

are not compatible with a display capability of the device so 24. The method of claim 23 further comprising receiving 

that the content information of the second formatting Ian- information from a device that indicates the input/output 
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format of the device so that the translated content is gener- 38. The method of claim 37, wherein the presentation 

ated based on the input/output format of the device. cards in the presentation shoe are transmitted to the device 

23. The method of claim 23, wherein the device is a so that the content information of the second formatting 

wireless device having a predetermined display screen size. language can be displayed on the device. 

26. The method of claim 23, wherein the receiving further s 39. A system for translating content information from a 
comprises establishing a communications session with the ^ l formatting language to a second formatting language so 
host system so that information content of the first format- mat ^ content information can be displayed on a device, 
ting language can be received by the translator. comprising: 

27. The method of claim 26, wherein the information a hash table module for comparing a URL Internet address 
content of the first formatting language comprises HTML. 10 of a web P a 8 e including the content information of the 

28. The method of claim 26, wherein the intermediate first formatting language with a group of predetermined 
formatting language is XHTML. rulesets that define a criteria for translating the content 

29. The method of claim 23, wherein the translation information to an intermediate formatting language to 
further comprises performing operations with an XML generate content information of the intermediate for- 
engine. is mailing language, 

30. The method of claim 29, wherein the performing 3 tran f 0 ™ P roce *f r a ^& 1K < 1 10 convert the received 

~~»-o*; o „ -tu vni>rr ~: n - content information of the intermediate formatting Ian- 

^TTf^T^I^^?^ ^ ^ T" 5 ? < ? mP f ar " «W to content information of a second intermediate 

™t£ ™ f of the received content infor- fornTatting language to generate content in the second 

matron with a group of predetermined rulesets using a formatting intermediate language; and 

hasbteble module wherein the rulesets define a criteria for 20 a j t tD ^ m ^gutd translate the content infor- 

translatmg the content information to the second formatting mation t0 me formatting langu from the 

language; and converting the received content information second intermediate formatting language and to format 

of the intermediate formatting language to content informa- me content information for display on the device, 

tion of a second intermediate formatting language using a 4®. The system of claim 39, wherein the layout engine 

transform processor. 25 comprises a content cutter for extracting portions of the 

31. The method of claim 30, wherein the second inter- content information of the second intermediate language that 
mediate formatting language comprises RML. are not compatible with a display capability of the device so 

32. The method of claim 30, wherein the group of that the content information of the second formatting lan- 
predetermined rulesets define a branding criteria for the guage can be generated from the remaining content infor- 
content information of the second formatting language so 30 mation of the second intermediate language; a layout pro- 
that the content information of the second formatting lan- cessor for dynamically generating a layout format for the 
guage displayed on the device is similar in appearance to content information of the second formatting language that 
that of the content information of the first formatting lan- * op^n* 2 ^ fo * m e display screen size of the device; and a 
guage received from the webpage. protocol processor for generating the content information of 

33. The method of claim 32, wherein the group of 35 J 6 ?!? 1 formattin g 

predetermined rulesets comprise XSL rulesets defined by the 41 " V 1 * system of claim u 4() ' wherem the layout format 

host system comprises a presentation shoe that includes at least one 

34. The method of claim 23, wherein the formatting P^ ntatioa ^ Potation card representing a 
,™ • _ i * • * display page of the device, each presentation card including 
compr^usmgakyout en^e at ^ f^on of the Wmation of me second 

35. The method of claim 34 wherein the layout engine 40 formatting language, wherem subsequent presentation cards 
comprises extracting portions of the content information of m the presentation shoe are linked so that the content 
the second intermediate language using a content cutter that information of the second formatting language can be selec- 
are not compatible with a display capability of the device so tivefy viewed on the device. 

that the content information of the second formatting lan- 42. Hie system of claim 41, wherein the presentation 

guage can be generated from the remaining content infor- 45 cards include at least one pane portion thereon and wherein 

mation of the second intermediate language; dynamically the content information is separated into atomic groups of 

generating a layout format for the content information of the content information, the atomic groups of content informa- 

sccond formatting language that is optimized for the display tion being organized and assigned to the pane portions of the 

screen size of the device using a layout processor; and presentation card so that formatting of the content informa- 

generating the content information of the second formatting 50 tion can be optimized for the display screen on the device, 

language using a protocol processor. 43. The system of claim 42, wherein the presentation 

36. The method of claim 35, wherein the layout format- cards in the presentation shoe are transmitted to the device 
ting comprises a presentation shoe that includes at least one so that the content information of the second formatting 
presentation card, each presentation card representing a language can be displayed on the device. 

display page of the device, each presentation card including 55 44. The system of claim 39, wherein the group of prede- 
at least a portion of the content information of the second termined rulesets define a branding criteria for the content 
formatting language, wherein subsequent presentation cards information of the second formatting language so that the 
in the presentation shoe are linked so that the content content information of the second formatting language dis- 
information of the second formatting language can be selec- played on the device is substantially similar in appearance to 
lively viewed on the device. eo the content information of the first formatting language 

37. The method of claim 36, wherein the presentation received from the webpage. 

cards include at least one pane portion thereon and wherein 45. The system of claim 44, wherein the group of prede- 

the content information is separated into atomic groups of termined rulesets comprise XSL rulesets defined by the host 

content information, the atomic groups of content informa- system. 

tion being organized and assigned to the pane portions of the 65 46. A method for intelligently harvesting information 

presentation card so that formatting of the content informa- from a data source for display on one or more different 

tion can be optimized for the display screen on the device. information appliances, comprising: 
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